Mining the Blogosphere to Generate Cuisine Hotspot Maps

نویسندگان

  • Chia Chun Shih
  • Ting-Chun Peng
  • Wei Shen Lai
چکیده

AbstrAct: Choosing a restaurant is one of the most frequent decisions faced in modern daily life; however, it is difficult for consumers to choose between food/restaurant by reading large amounts of reviews. This study attempts to generate cuisine hotspot maps through blog content mining to help consumers make restaurant decisions by specialties. The main obstacle in doing this involves recognizing and extracting restaurants and essentialrestaurant information (i.e., restaurant dishes) in unstructured content. In contrast to traditional Named Entity Recognition (NER) targets, dish name is a promising target that received little attention in previous studies. This study develops methods for recognizing and extracting restaurant names and dish names from review postsin the blogosphere and achieves satisfactory performance. Based on the method, we processed more than 12,000 Chinese blog posts and generated a cuisine hotspot map. The map shows the most popular dishes of restaurants in a map-view to help consumers make restaurant decisions. A prototype of cuisine hotspot map, named CuisineGuide, is implemented and available as an iPhone application.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Hydrograph Modeling Using SGSim: A Case Study of Behbahan Aquifer, Southwest of Iran

Hydrograph modeling and prediction of groundwater levels are the main concerns of most hydrogeological calculations and water resource management process. The present study is an application of Sequential Gaussian Simulation (SGSim) method for predicting groundwater levels using recorded monthly data (180 months) related to 21 piezometers of Behbahan aquifer, southwest of Iran. To generate real...

متن کامل

Modeling and Data Mining in Blogosphere

This book offers a comprehensive overview of the various concepts and research issues about blogs or weblogs. It introduces techniques and approaches, tools and applications, and evaluation methodologies with examples and case studies. Blogs allow people to express their thoughts, voice their opinions, and share their experiences and ideas. Blogs also facilitate interactions among individuals c...

متن کامل

Genetic Crossovers Are Predicted Accurately by the Computed Human Recombination Map

Hotspots of meiotic recombination can change rapidly over time. This instability and the reported high level of inter-individual variation in meiotic recombination puts in question the accuracy of the calculated hotspot map, which is based on the summation of past genetic crossovers. To estimate the accuracy of the computed recombination rate map, we have mapped genetic crossovers to a median r...

متن کامل

Compass: A hybrid method for clinical and biobank data mining

We describe a new method for identification of confident associations within large clinical data sets. The method is a hybrid of two existing methods; Self-Organizing Maps and Association Mining. We utilize Self-Organizing Maps as the initial step to reduce the search space, and then apply Association Mining in order to find association rules. We demonstrate that this procedure has a number of ...

متن کامل

Self-organizing maps for latent semantic analysis of free-form text in support of public policy analysis

The huge amount of free-form unstructured text in the blogosphere, its increasing rate of production, and its shrinking window of relevance, present serious challenges to the public policy analyst who seeks to take public opinion into account. Most of the tools which address this problem use XML tagging and other Web 3.0 approaches, which do not address the actual content of blog posts and the ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:
  • JDIM

دوره 8  شماره 

صفحات  -

تاریخ انتشار 2010